【周末特辑】9月第3周最火AI论文 | 群智RL提速大模型;小VLA零预训练控机械
Update: 2025-09-14
Description
本期的 5 篇论文如下:
[00:40 ] TOP1(🔥455) | 🤝 Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing(共享即关爱:基于集体RL经验共享的高效大模型后训练)
[03:19 ] TOP2(🔥163) | 🤖 VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model(VLA-Adapter:面向小型视觉-语言-动作模型的有效范式)
[05:44 ] TOP3(🔥156) | 🤔 Why Language Models Hallucinate(语言模型为何产生幻觉)
[07:57 ] TOP4(🔥139) | 💡 Reverse-Engineered Reasoning for Open-Ended Generation(面向开放式生成的逆向工程推理)
[10:35 ] TOP5(🔥131) | 🧠 A Survey of Reinforcement Learning for Large Reasoning Models(大型推理模型的强化学习综述)
<figure>
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
Comments
In Channel